Salary: ₹18 - ₹28 Lakhs/Annum Expected
Description:
Optum’s Applied AI team is looking for a Senior Data Scientist (Core in Data Analysis) to help build and maintain data pipelines that power large-scale AI/ML initiatives in healthcare.
In this role, you will work closely with data engineers, ML experts, and clinical domain specialists to prepare, analyze, and manage structured and unstructured healthcare datasets. Your expertise will directly enable AI-driven solutions that make healthcare more effective and accessible.
If you enjoy working with big data, building scalable pipelines, and solving complex challenges in AI/ML, this role offers you the opportunity to make an impact in a global Fortune 10 organization.
What you’ll do:
- Collaborate with ML engineers, annotators, and clinical experts to turn business problems into AI solutions.
- Design and maintain scalable data pipelines for AI/ML workflows.
- Implement automated data labeling pipelines using active learning, weak supervision, and human-in-the-loop methods.
- Perform EDA (Exploratory Data Analysis) and data validation on healthcare datasets.
- Prepare structured (EHRs, EMR data) and unstructured (PDFs, clinical notes) data for model training and monitoring.
- Use Airflow or Step Functions for orchestration of data workflows.
- Monitor data quality, drift, and consistency across multiple sources.
- Create dashboards and reports using tools like Power BI, Tableau, or Plotly to communicate pipeline health and insights.
- Ensure proper version control, documentation, and collaboration across the team.
Key Technical Skills:
Python (Advanced), SQL (Advanced), Data Pipelines, Airflow, Jupyter Notebook, Git, Power BI, Tableau, Plotly, AWS (basic exposure), Data Quality Monitoring, Excel (Advanced), NLP in Healthcare, Clinical Data Analysis, EMR/EHR Data
Requirements:
- Bachelor’s degree in Computer Science or related field.
- Advanced degree in Data Science, Statistics, Applied Mathematics, or related areas (preferred).
- Minimum 4 years of experience in Data Science, specifically in data analysis and pipeline development.
- Strong programming skills in Python and SQL.
- Experience in healthcare/NLP domain (clinical data, EMRs, coding workflows).
- Knowledge of data quality monitoring techniques.
- Familiarity with AWS ecosystem.
- Excellent communication skills to present insights to technical and non-technical teams.
- Flexibility to work during critical business periods and willingness to adapt to varying shifts.
- Proven team player with strong collaboration skills.
Preferred:
- Advanced hands-on experience with visualization tools like Power BI, Tableau, or Plotly.
- Prior work with data drift detection and automation.
- Experience handling large-scale healthcare datasets.
Important Notice:
This job description and related content are owned by Optum. We are only sharing this information to help job seekers find opportunities. For application procedures, status, or any related concerns, please contact Optum directly. We do not process applications or respond to candidate queries.